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Abstract — We consider the problem of detecting the presence of 
a spatially correlated multichannel signal corrupted by additive 
Gaussian noise (i.i.d across sensors). No prior knowledge is 
assumed about the system parameters such as the noise variance, 
number of sources and correlation among signals. It is well 
known that the GLRT statistics for this composite hypothesis 
testing problem are asymptotically optimal and sensitive to 
variation in system model or its parameter. To address these 
shortcomings we present a few non-parametric statistics which 
are functions of the elements of Bartlett decomposed sample 
covariance matrix. They are designed such that the detection 
performance is immune to the uncertainty in the knowledge of 
noise variance. The analysis presented verifies the invariability of 
threshold value and identifies a few specific scenarios where the 
proposed statistics have better performance compared to GLRT 
statistics. The sensitivity of the statistic to correlation among 
streams, number of sources and sample size at low signal to 
noise ratio are discussed. 

Index Terms — multichannel detection, generalised likelihood 
ratio test, covariance based detection, covariance absolute value 
detection 

I. Introduction 

Detecting the presence of signal affected by channel im- 
pairments and corrupted by additive noise is encountered in 
a variety of array processing applications. The goal is to 
classify the observation into one of the two possibilities (T), 
i.e., signal present/not present. The detection statistic for this 
binary hypothesis testing problem is said to be optimal in 
the Neyman-Pearson (N-P) sense if it maximizes detection 
probability (P^) for a fixed false alarm level (Pf a )- It is well 
known that the likelihood ratio test (LRT) assures optimality 
in the N-P sense if the two distributions are known precisely. 
However, in the case of system with uncertain parameters the 
hypotheses are composite. In such case, the statistic which is 
optimal in the N-P sense irrespective of the value taken by 
the uncertain parameters is called uniformly most powerful 
statistic. There is no straightforward procedure to find it and 
they don't exist in many real life scenarios Q. An alternate 
approach is to obtain the asymptotically optimal detection 
statistic called the generalized likelihood ratio test (GLRT) 
by replacing the uncertain parameters with their maximum 
likelihood estimate (MLE) in the likelihood ratio. 

The MLE of the covariances under the two hypotheses 
for the multichannel signal corrupted by additive Gaussian 
noise observation model is the function of sample covariance 
matrix (SCM), because it is the sufficient statistic |3). A few 



variations in the final form of the GLRT statistic depending on 
the prior information or assumption about the system model 
and its parameters are discussed in the following paragraph. 

When the noise across sensors are assumed independent 
resulting in diagonal covariance structure GLRT reduces to 
coherence ratio test JT] and under i.i.d assumption resulting 
in homogeneous diagonal covariance structure GLRT reduces 
to sphericity test J4J. In addition, the information about the 
number of sources (N t ) contributes in improving the detection 
strategy by estimating the noise from the latent roots of the 
model. This method is advantageous when Nt is less than 
the number of sensors, JV r . Assuming the exact knowledge of 
N t , the GLRT reduces to reduced sphericity test which is 
interpreted as a measure of sphericity of the sample covariance 
space to the noise subspace. If there is only a single source and 
noise variance (ct 2 ) is assumed to be known, GLRT reduces to 
maximum eigenvalue test, also known as Roy's largest root test 
(RLRT) J6). It can also be used as a non-parametric statistic 
when number of sources is more than one. 

The GLRT statistics are known to be sensitive to the 
prior information or assumption about the system model and 
its parameters. Also, in practical scenarios no information 
regarding the data will be available at the detector and the 
sample size will also be limited. To address the shortcomings 
with these techniques we resort to non-parametric statistics 
which exploit the spatial correlation across sensors similar to 
the ones proposed in and references therein. The covariance 
absolute value (CAV) statistic, proposed in [8| belongs to this 
category which operates directly on the elements of SCM. It 
was proposed for the real system model and the test statistic 
was defined as, 
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(1) 



where ry represents the (i,j) element of SCM, R. 

The CAV statistic is used as an ad hoc measure to identify 
the contribution of off-diagonal elements. Under the null 
hypothesis, the SCM is diagonal due to spatially uncorrelated 
noise. Hence the CAV statistic approaches unity under the 
null hypothesis and is greater than unity under the alternate 
hypothesis due to the existence of correlation either in the 
signalling method or when induced spatially. Due to the 
effectiveness of CAV its performance is used as a reference to 



compare the blind statistics with the GLRT statistics |9l, |5l, 
iflOl . This motivates us to look into other forms of co variance 
based ratios which can outperform the well established CAV. 
Our previous work Hill extends the analysis of CAV to include 
the complex data considering its equivalent form as: 



TABLE I 
Statistics: Type 1, 2, 3 and 4 




(2) 



l<i<j<N., 

Due to the dependent nature of the numerator and the de- 
nominator terms, the analysis of the statistic is cumbersome. 
Therefore a statistic similar to CAV is formulated using 
the elements of Q, the Bartlett decomposed SCM , where 
R = QQ H 




(3) 



Ti(R)= £ 

l<j<i<N, 

The Bartlett decomposition makes the elements of lower 
triangular Q matrix independent. They have the following 
distributional property under Mq l3l . 
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XN-i+l, 



1< i < N r 



- — qij ~ CAf(0, 1), 1 < j < i < N r 
a 

where Xfc denotes a chi-random variable with k degrees of 

freedom (d.o.f). The independency between the numerator 

and denominator terms makes the analysis of the statistics 

formulated using the elements of Q simpler compared to the 

ones which use the dependent elements of R. 

This motivates us to look into more possibilities of forming 
statistics with the elements of Q. The analysis presented in 
this paper considers non-parametric statistics on complex data 
which are designed as functions of elements of Q in the form 
of ratios similar to CAV and their combination. We show 
that the combined statistics are robust against uncertainties in 
the value of noise variance and correlation. Moreover, a few 
scenarios were identified under which these statistics exploit 
the correlation property better than the blind GLRT statistics 
and the CAV statistic resulting in improved performance. 

Section|lI]presents problem formulation, followed by analy- 
sis and observations about a number of non-parametric statis- 
tics in Sectionini] Based on the analysis, we propose in section 
[IV]combining these statistics leading to improved performance 
and less sensitivity to variation in system parameters. 

II. Problem Formulation 

Observation X G C N, ' xN represents a block of N samples 
across N r sensors, giving rise to the two hypotheses model: 

Hs + r] , Signal hypothesis M\ 
r] , Null hypothesis 3%q 

where r)EC NrXN is the additive noise, which is assumed to be 
zero-mean circular complex Gaussian, spatially uncorrected 
and temporally white with the covariance matrix a 2 lN r - The 
signal transmitted from the N t number of sources (assuming 
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N t <N r ) is temporally uncorrected and assumed to be i.i.d 
standard complex Gaussian vector, i.e, s 6 C NtXN ^CJV(0, 1). 
The channel present between the source and the sensor is 
assumed constant over the observation duration and modeled 
as correlated complex Gaussian matrix, i.e., H e QN T xN t 
wherein each hij ~ CAf(0, 1/Nt). The channel matrix H 
is modeled to capture the correlation that might be present 
between sensors, and one that is introduced by the channel. 
The correlation present in the model is assumed unknown. The 
SCM is calculated as R=XX H /iV. 

Our aim is to design a statistic for the classification without 
having any information about the system parameters. We 
consider non-parametric statistics shown in table [[] whose 
formulations are similar to the CAV statistic in (J2}. They are 
called as Type 1, 2, 3 and 4 and are functions of Q. 

III. Analysis of test statistics: Type 1, 2, 3 and 4 

The exact closed form expression for the distribution of 
the statistic under the null hypothesis is required to find 
the detection threshold. Approximations are used when exact 
closed form expressions are not available and are intractable. 

A. Threshold Calculations 

1) Type 1: For the T\ statistic defined in Table U note that, 
|<jtjj-| ~ Rayleigh (1/V2). The distribution of sum of these 
Nu=N r (N r — l)/2 independent Rayleigh random variables 
can be calculated as in |fl~2). However, we approximate the 
sum distribution by the Gaussian tail approximation with the 
following parameters. 
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Since straightforward simulation show that the Gaussian tail 
approximation is more accurate than [12| for the above case, 
the threshold calculated with this approximation method is 
used to evaluate Pd in Figfl] The denominator is trace of Q, 
which is approximated to its mean value p x (see Appendix 
lAl under moderately large sample size assumption. Therefore, 
the parameters of the Gaussian distribution in (|5) are scaled 
by l//i x and the threshold for 2\ for a given Pf a is given by, 

7th=0WM x ) + {o-%/^)Q- 1 {Pf & ) 



Q denotes the tail probability of a Gaussian distribution. 

2) Type 2: Using similar arguments, T^ can be shown to be 
a scaled Rayleigh with parameter -JNr/2/ p, x . The Rayleigh 
CDF with this parameter is used to calculate the threshold. 

3) Type 3: Note that the numerator of T% is the square 
of the numerator of T\ with approximate distribution given 
in (0. After squaring and scaling with 1/c 2 ^, it transforms 
to non-central \ 2 with 1 d.o.f and non-centrality parameter 
5=(pn/&n) 2 - The denominator is \ 2 with d = (N + l)N r — 
N r (N r + l)/2 d.o.f. When scaled properly the ratio follows 
Fx t d(S) distribution, i.e., 
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»N *d/ d 

Similar statistic is proposed in IfTJI for the real system model. 
4) Type 4: When the numerator of T4 is scaled with 
(N r (N r - I)) -1 , it follows x 2 distribution with 2 d.o.f. The 
denominator is similar to (|6). Hence, 

xl/2 
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F- 
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(7) 



2N r {N r -l) — x 2 Jd 

B. Simulation set-up and Observations 

We consider N r =6 and Nt—A. lfT4l and (6) among others 
assume large N (N ^> 100), we however assume moderate 
N (N ps 100) similar to |5j and |15|. Since we consider the 
blind detection problem we assume that the structure of the 
covariance matrix under the signal hypothesis is not known 
at the detector. If the detector were not blind and knows the 
covariance structure, then the test statistic could be formed 
to exploit it. We restrict our focus to the blind detection 
and impose AR(1) spatial covariance structure lTT6l on the 
multichannel signal for the simulation purpose (for comparison 
of different statistics ability to exploit correlation) such that, 

X = P 1/2 Hs + r7 

P 1 / 2 !! corresponds to H in ©, where elements of H are in- 
dependent and hij ~ CAf(0, 1/Nt). P captures the correlation 
present in the system and P 1 / 2 is its Cholesky decomposition. 



P 

P 
1 



and H = P 1/2 H 



(8) 



The average received signal to noise ratio across each sensor 
is given by, 



SNR 



£[tr(vv H )] 



where v 



P^Hs 



EMvv H )} 

For each SNR, the distribution of the statistics Ti, T 2 , T 3 
and T4 under the two hypotheses are obtained through 1000 
Monte-Carlo realizations. The detection threshold is found 
using the null distribution of the statistics keeping a fixed 
constraint on the value of Pf a (=0.1). The performance (Pd) 
evaluated using this threshold is denoted as simulation whereas 
Pd obtained with the thresholds found in section IIII-AI is 



denoted as Approximation in Fig. \T\ To verify the accuracy 
of approximations used in deriving the threshold, the Pf a is 
calculated using the null distribution of the statistic obtained 
through Monte-Carlo simulation and plotted along with Pd- 

The calculations in section IIII-AI indicate that the detection 
thresholds are independent of noise variance and Fig.[T]verifies 
the accuracy of these calculations in maintaining Pf a at the 
preset value 0.1. It also verifies the validity of threshold at 
sample size N = 100. In terms of performance we observe 
from Fig. Q] that T\ and T3 are equivalent and T^ and T4 
are equivalent. Note that both T? and T4 have cross terms 
(refer (fl4] i in Appendix |Bj making them highly sensitive to 
correlation among streams. 

It would be desirable to have the performance independent 
of the correlation p because correlation p is not known. How- 
ever, we desire to retain the good features of T\ (or T3) under 
low correlation and that of T2 (or T4) under high correlation. 
We now propose the combination statistics, T\% = T\ + T2 
and T 34 = T 3 +Ti such that in T 12 (or T 34 ), 2i (or T 3 ) will 
dominate under low correlation and T^ (or T4) will dominate 
under high correlation. 

IV. Combination of statistic 
A. T12 statistic 

When the denominator is replaced with the mean value p, x , 
the combination statistic effectively has mean and variance 
(from the similar arguments used in Type 1 and Type 2 
threshold calculations) given as, 
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The detection threshold using Gaussian tail approximation is, 

7th = M12 + <712<9 _1 (P&) 

B. T34 Statistic 

From ©, the scale dja 2 N makes T3 follow Fi,<j(d). Since 
the scaling should be same for both terms, T4 after scaling is, 
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(10) 



The numerator of (TTOt is Rayleigh random variable with 
parameter \jNn/2/aN- When squared, it is distributed as 
exponential random variable with parameter A=2/(2 — 7r/2) 
which is independent of system parameters N r and N. There- 
fore the ratio in ([Tot is scaled F24 distribution with factor A. 
Effectively the distribution of scaled T34 is written as sum of 
two correlated F distributions (central and non-central), i.e., 



T M ~F 14 {5) + AF 2A = A + B 
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'N 



where = denotes term wise equivalence. The approximate 
correlation between the two terms is given in Appendix |B] 
If (pa, & 2 a) an d (fJ-B, &%) are mean and variance of A and B, 



6x4 , N=100 , P =0.1 , p=0.3 



6x4 , N=100 , P =0.1 , p=0.7 



1 

0.9 




1 1 M— ' 

T i andT 3 eS^ 


9 * Si 1 

41 0.9 


! ! ^-^r—— ~ 


| » « 


0.8 




\/ 




0.8 
0.7 


T 2 andT 4 _^Y 


- 


0.7 


, T (Simulation) 


T (Simulation) 








_*_T (Approximation) 






^ T (Approximation) 


11.6 






_B-T, (Simulation) 


0.6 




D T (Simulation) 


0.5 


P „ 




rv T (Approximation) 


™ 0.5 




':-Jr*. 


r^T (Approximation) 


0.4 






y/^i"'" 1 , 


q T (Simulation) 

» T (Approximation) 


0. 

0.4 




Jr T ( and T 3 


T (Simulation) 

,-, T (Approximation) 


0.3 






g T (Simulation) 


0.3 




_g_T (Simulation) 


0.2 




P f« 

0" i 


a T (Approximation) 


0.2 
' 0.11 

1 


- P f B 

» * i . 


a T (Approximation) 








-1 












2 -10 




-8 -6 -4 
SNR 


2 


2 


-10 -8 -6 -4 
SNR 


2 2 



(a) 



(b) 



Fig. 1. P d vs. SNR for K^p = 0.3 andTOp = 0.7 



the 7th can be calculated using Gaussian tail approximation, 

i.e., 

i-i/ 



N =6 , N=100 , P =0.1 , SNR=-10 dB 



7th = M34 + 034 <9 (P fa ) 



(12) 



where ^34 = fj, A + p B and cr| 4 = <J 2 A + a B + ^Pab <Ja<Jb- 
V. Simulation results and Discussion 

We compare the performance of X12 and T34 with the CAV 
statistic and blind GLRT statistics, such as coherence ratio 
test and sphericity test. Also, we consider reduced sphericity 
test and RLRT which assume complete knowledge about the 
parameters N t and a 2 respectively. This will enable us to 
know the loss in performance of the blind statistics for not 
knowing these parameters. Also, we analyse the sensitivity of 
the statistics to variation in system parameters at very low 
SNR (-10 dB). The RLRT statistic is omitted in sensitivity 
comparison because it is less sensitive to variation in p and Nt. 
The simulation set-up is similar to IIII-BI and chosen system 
parameters are indicated in each figure. 

A. P d vs. SNR 

The performance of the statistics at different SNR under 
low correlation (p = 0.3) and high correlation (p = 0.7) is 
plotted in Fig. |2(a)| and |2(b)| We observe that the performance 
of the T34 statistic is equivalent to the coherence ratio statistic 
(blind GLRT statistic) under low correlation and has a better 
performance compared to blind GLRT statistics under high 
correlation. T\% is advantageous compared to T34 under high 
correlation, however, performs poorly under low correlation. 
The complete knowledge about the noise variance makes the 
RLRT statistic perform better than blind statistics. The loss in 
performance due to lack of knowledge about the noise variance 
is significant under low correlation. Under high correlation the 
combination statistics reduce this loss by exploiting the spatial 
correlation. Moreover, the combination statistics exploit the 
correlation property better than the CAV statistic. 

The overshoot in Pf a indicates the effect of underestimation 
of the threshold. It is because the variance in (0 is calculated 
neglecting the correlation between T\ and T%. The Gaussian 
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Fig. 3. P d vs. p for N t = 1 and Nt = 4 



tail approximation is accurate for the T34 statistic. It verifies 
the validity of threshold under low sample sizes (TV = 100) 
and also the robustness against the uncertainty in the value of 
noise variance. 

B. Correlation among streams (p) 

It is expected that the detection performance should increase 
due to deviation in the observation's spherical structure as 
correlation among streams increases. The Fig. [5] depicts the 
effect of variation in the value of correlation on the perfor- 
mance of the statistics for a fixed number of sources in the 
system. If there are more than one source (N t >l), increase in 
correlation improves the performance of T12 and T34 statistic 
and this improvement is significant compared to the blind 
GLRT statistics. This shows that the combination statistics 
exploit the correlation property better than the blind GLRT 
statistics. However, when there exists only a single source 
(N t =l, rank-1 channel), the correlation among channels in 
worse conditions results in decrease in performance {Pd) with 
increase in correlation p. The combination statistics T12 and 
T34 perform poorly under this condition. 
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Fig. 2. P d vs SNR forKaflp = 0.3 andKFflp = 0.7 



C. Number of sources (N t ) 

The performance of the statistics (Pd) decreases with in- 
crease in the number of sources ifTTI . This is due to the 
alignment of dominant right singular vectors of the channel 
in the statistical direction of the transmit covariance matrix, 
which is well known in MIMO literature by the name channel 
hardening effect (YD- 

The effect of variation in the number of sources on the 
performance of the statistics under low and high correlation is 
plotted in Fig. |4(a)| Under low correlation T\% performs poorer 
than the blind GLRT statistics, however, it outperform all the 
other statistics (including X34) under high correlation. The 
performance T34 statistic is equivalent to blind GLRT statistics 
under low correlation and performs better than blind GLRT 
statistics under high correlation. The combination statistics are 
almost invariant to variation in Nt under high correlation. 

D. Sample size (N) 

The performance with variation in sample size (N) fixing 
the other two parameters Nt and p is plotted in Fig. |4(b)| 
As expected, the detection probability for all the statistics 
approaches 1 as the sample size increases. The T\% and X34 
statistics perform better than blind GLRT statistics under high 
correlation for all sample sizes. When low correlation scenario 
is considered T\i performs poorer than GLRT statistics, how- 
ever, T34 statistic is equivalent to the blind GLRT statistics. 
Therefore, T34 is the best choice if the statistic has to perform 
equally well under both high and low correlation scenario. 

VI. Conclusion 

The performance improvement for the considered multi- 
channel detection problem, compared to sensitive and asymp- 
totically optimal GLRT statistics, is achieved through combin- 
ing the non-parametric statistics. The threshold calculations 
verifies the independent nature of the detection thresholds on 
the value of noise variance making the statistics robust to 
uncertainty in them. The Monte-Carlo simulation verifies it 
and also validates the approximation techniques used. 



Under high correlation the proposed combination statistics 
have better performance compared to blind GLRT statistics 
and the CAV statistic from which all the designed statistics 
are motivated. Also, they are insensitive to variation in Nt 
and have better performance at low sample sizes. Under low 
correlation, the performance of X34 is equivalent to blind 
GLRT statistics, however, performance of T12 is poorer than 
blind GLRT statistics. Therefore, if the statistic has to be 
chosen independent of correlation, T34 would be a better 
choice. The only scenario where the combination statistics fail 
is when there exists only a single source (N t =l) in the system. 
In such scenario, both T12 and T34 perform worse compared to 
blind GLRT statistics and CAV statistic. Extending the analysis 
to more general correlation model opens up many possibilities 
for future work. 

Appendix 
A. Approximation for the trace of Q 

When k is large, the mean of \k random variable is, 

' r(fc/2) V 4fc 

The variance of Xk random variable, (k — /1 2 ) is far less 
compared to its mean p when k is moderately large. This 
is shown here. 



/'" 



*-*(!-&) 



]_\2 

4 A- •' 



1 



<1 



/i /i 2/i 

Hence, the trace of Q is replaced with its mean given by, 
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B. Approximate correlation coefficient 

912, 913 ■ ■ • qiN r , 923, 924 •• ■ 9AT r -ijv r be represented as 
&i + jbi, a,2 + j&2 ■ • ■ a N R + j bN R - Note that they are all 



N =6 , N=100 , P =0.1 , SNR=-10 dB 
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Fig. 4. P d vs N t for p = 0.3 and 0.7,Rb)|P d vs log N for p = 0.3 and 0.7 



independent. Let F and Z be numerator of T3 and T4 
respectively, then 
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The cross term is calculated as, 
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E[YZ}= N R [2 + (N R - 1)(1 + 0.257r(7V fl - 2) + 
The mean and variance of F, Z are calculated as, 



/?rr2.5)] 



(my=Nr + 0.25%N R (N R - 1), cr^ = 2 + 45 
Hz=N R , 



N 2 R 



where (5 is the non-centrality parameter defined in ©. pyz is 
calculated using these parameters and pab ~ Prz- 
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